Multiple Imputation in a Complex Sample Survey

نویسنده

  • Eugene M. Burns
چکیده

Multiple imputation for missing survey data is relatively new concept. As defined by one of its leading proponents, "multiple imputation is the technique that replaces each missing or deficient value with two or more acceptable values representing a distribution of possibilities" (Rubin 1987, p.2). Multiply-imputed data reflects the uncertainty contained in the imputation process in a way not possible with singly-imputed data. Incorporating multiple imputation into a survey, both as an imputation method and in subsequent estimation, does present some practical problems. So far, most discussions of multiple imputation have been limited to surveys with some relatively tractable sample design (or have disregarded the sample design (Oh and Scheuren 1980)). However, complex sample surveys are another important set of surveys. In addition to the need for large-scale imputation for many missing items, complex sample surveys are characterized by features such as multistage stratified cluster sampling (possibly listsupplemented), reweighting for unit nonresponse, and possible post-stratification adjustments. Incorporating multiple imputation into such a survey would seem to be a formidable task. This paper describes proposed procedures, and presents results, for incorporating multiple imputation into one complex sample survey, the Energy Information Administration's Commercial Buildings Energy Consumption Survey (CBECS) (Energy Information Administration 1988, 1989). The CBECS, conducted triennially, has a

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Introduction to Multiple Imputation of Complex Sample Data using SAS ®

This paper presents practical guidance on the proper use of multiple imputation tools in SAS® 9.2 and the subsequent analysis of multiple imputed data sets from a complex sample survey data set. Use of the MI and MIANALYZE procedures and SAS survey procedures for typical descriptive and inferential analyses is demonstrated. The analytic techniques presented can be used on any operating system a...

متن کامل

An Empirical Comparison of Performance of the Unified Approach to Linearization of Variance Estimation after Imputation with Some Other Methods

Imputation is one of the most common methods to reduce item non_response effects. Imputation results in a complete data set, and then it is possible to use naϊve estimators. After using most of common imputation methods, mean and total (imputation estimators) are still unbiased. However their variances (imputation variances) are underestimated by naϊve variance estimators. Sampling mechanism an...

متن کامل

Inference with Survey Data Imputed by Hot Deck When Nonrespondents Are Nonidentifiable

Hot deck imputation for nonrespondents is often used in surveys. It is a common practice to treat the imputed values as if they are true values, and compute survey estimators and their variance estimators using standard formulas. The variance estimators, however, have seriously negative biases when the rate of nonresponse is appreciable. Methods such as the multiple imputation and the adjusted ...

متن کامل

Title Sensitivity of Econometric Estimates to Item Non - response Adjustment

Non-response in establishment surveys is a very important problem that can bias results of statistical analysis. The bias can be considerable when the survey data is used to do multivariate analysis that involve several variables with different response rates, which can reduce the effective sample size considerably. Fixing the non-response, however, could potentially cause other econometric pro...

متن کامل

Multiple Imputation Using the Fully Conditional Specification Method: A Comparison of SAS®, Stata, IVEware, and R

This presentation emphasizes use of SAS 9.4 to perform multiple imputation of missing data using the PROC MI Fully Conditional Specification (FCS) method with subsequent analysis using PROC SURVEYLOGISTIC and PROC MIANALYZE. The data set used is based on a complex sample design. Therefore, the examples correctly incorporate the complex sample features and weights. The demonstration is then repe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002